Searching the Audio Notebook: Keyword Search in Recorded Conversation

نویسندگان

  • Peng Yu
  • Kaijiang Chen
  • Lie Lu
  • Frank Seide
چکیده

MIT’s Audio Notebook added great value to the note-taking process by retaining audio recordings, e.g. during lectures or interviews. The key was to provide users ways to quickly and easily access portions of interest in a recording. Several non-speech-recognition based techniques were employed. In this paper we present a system to search directly the audio recordings by key phrases. We have identified the user requirements as accurate ranking of phrase matches, domain independence, and reasonable response time. We address these requirements by a hybrid word/phoneme search in lattices, and a supporting indexing scheme. We will introduce the ranking criterion, a unified hybrid posterior-lattice representation, and the indexing algorithm for hybrid lattices. We present results for five different recording sets, including meetings, telephone conversations, and interviews. Our results show an average search accuracy of 84%, which is dramatically better than a direct search in speech recognition transcripts (less than 40% search accuracy).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Can automatic speech recognition be satisficing for audio/video search? Keyword-focused analysis of Hebrew automatic and manual transcription

With massive amounts of academic audio and video content over the web, it is important to assess the performance of state-of-the-art automatic speech recognition (ASR) systems for audio/video navigation through search queries. This paper suggests a novel perspective of the challenges of ASR: instead of minimizing word error rates (WER), focus on keyword recognition. Focusing on keywords may be ...

متن کامل

Supporting Real Estate Search Through Automatic Information Suggestion

Searching real estate property is uncommon task for most of the users. As a result, the user is not familiar with the detailed search condition which is useful for search. In this paper, we propose to use voice recognition as a support for real estate property search. First user set vague search condition with GUI. Then system listens conversation between users. From the conversation, system ex...

متن کامل

Searching for Sounds: A Demonstration of FindSounds.com and FindSounds Palette

FindSounds.com is the first Web search engine for sound effects. In addition to keyword-based retrieval of audio files, FindSounds.com provides a “sounds-like search” for content-based retrieval. FindSounds Palette is a unique software program enabling local and remote audio files to be searched by content and at multiple speeds.

متن کامل

Teachers’ Strategies Used to Foster Teacher-Student and Student-Student Interactions in EFL Conversation Classrooms: A Conversation Analysis Approach

Despite the fact that there are a wide range of strategies used to foster interactions in EFL conversation classrooms, many novice teachers are not aware of them. In view of this problem, the current study aimed to identify such strategies commonly used by EFL teachers in conversation classrooms. To this end, fifty sessions of college level conversation classrooms were observed andtheir teacher...

متن کامل

Search Engine Optimization for Threaded-Conversations

Online discussion communities are becoming increasingly popular among web users, where an extensive amount of discussion and commenting takes place. However, it is difficult to search these conversations as search engines are not optimized for the conversation-structure of online communities. In this paper, we purpose a method of ranking search results based on the conversation-structure and us...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005